Matching tutor to student: rules and mechanisms for efficient two-stage learning in neural circuits
نویسندگان
چکیده
Existing models of birdsong learning assume that brain area LMAN introduces variability into song for trial-and-error learning. Recent data suggest that LMAN also encodes a corrective bias driving short-term improvements in song. These later consolidate in area RA, a motor cortex analogue downstream of LMAN. We develop a new model of such two-stage learning. Using a stochastic gradient descent approach, we derive how ‘tutor’ circuits should match plasticity mechanisms in ‘student’ circuits for efficient learning. We further describe a reinforcement learning framework with which the tutor can build its teaching signal. We show that mismatching the tutor signal and plasticity mechanism can impair or abolish learning. Applied to birdsong, our results predict the temporal structure of the corrective bias from LMAN given a plasticity rule in RA. Our framework can be applied predictively to other paired brain areas showing two-stage learning.
منابع مشابه
Rules and mechanisms for efficient two-stage learning in neural circuits
Trial-and-error learning requires evaluating variable actions and reinforcing successful variants. In songbirds, vocal exploration is induced by LMAN, the output of a basal ganglia-related circuit that also contributes a corrective bias to the vocal output. This bias is gradually consolidated in RA, a motor cortex analogue downstream of LMAN. We develop a new model of such two-stage learning. U...
متن کاملSteps A Simulated Tutorable Physics Student
This paper describes a prototype of a simulated student that learns by interacting with a human tutor The system solves physics problems while showing its work on a workstation screen and the tutor can intervene at certain points during problem solving to advise the simulated student In particular the tutor can cross out incorrect actions and or enter correct actions These interactions cause th...
متن کاملEarly and late consolidation and reconsolidation of memory in the prelimbic cortex
Rats can learn to forage among olfactory cues to associate one with reward in only 3 massed trials. The learning is achieved in less than 10 min and results in a memory trace lasting at least 1wk week. To study the neuro-anatomical circuits involved in the memory formation we used immunoreactivity to the immediate early gene c-fos as a marker for neuronal activity induced by the learning. The p...
متن کاملEarly and late consolidation and reconsolidation of memory in the prelimbic cortex
Rats can learn to forage among olfactory cues to associate one with reward in only 3 massed trials. The learning is achieved in less than 10 min and results in a memory trace lasting at least 1wk week. To study the neuro-anatomical circuits involved in the memory formation we used immunoreactivity to the immediate early gene c-fos as a marker for neuronal activity induced by the learning. The p...
متن کاملNeural Circuits Trained with Standard Reinforcement Learning Can Accumulate Probabilistic Information during Decision Making
Much experimental evidence suggests that during decision making, neural circuits accumulate evidence supporting alternative options. A computational model well describing this accumulation for choices between two options assumes that the brain integrates the log ratios of the likelihoods of the sensory inputs given the two options. Several models have been proposed for how neural circuits can l...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2016